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The successive emergence of highly pathogenic coronaviruses (CoVs) such as the Severe Acute Respiratory 
Syndrome (SARS-CoV) in 2003 and the Middle East Respiratory Syndrome Coronavirus (MERS-CoV) in 
2012 has stimulated a number of studies on the molecular biology. This research has provided significant 


Keywords: new insight into functions and activities of the replication/transcription multi-protein complex. The latter 
SARS eOuoneavettls directs both continuous and discontinuous RNA synthesis to replicate and transcribe the large coronavirus 
as genome made of a single-stranded, positive-sense RNA of ~30 kb. 


In this review, we summarize our current understanding of SARS-CoV enzymes involved in RNA bio- 
chemistry, such as the in vitro characterization of a highly active and processive RNA polymerase complex 
which can associate with methyltransferase and 3’-5’ exoribonuclease activities involved in RNA cap- 
ping, and RNA proofreading, respectively. The recent discoveries reveal fascinating RNA-synthesizing 


Proofreading 


machinery, highlighting the unique position of coronaviruses in the RNA virus world. 


1. Introduction 


The evolution rates of RNA viruses are amongst the high- 
est of all organisms, making phylogenetic comparison between 
viral families often challenging. The Nidovirales order consists of 
four families that display broadly diverse features: from genome 
length spanning from 13 to 32kb, to the set of encoded genes, 
or host range. Nonetheless, all nidoviruses share common char- 
acteristics such as the use of ribosomal frameshifting in order to 
translate a part of the nonstructural proteins, including the RNA- 
dependent RNA polymerase and the helicase. Moreover, they all 
code for a set of subgenomic (sg) MRNAs, and also for six essential 
enzyme/protein domains displaying the same linear organization 
along the genome: a 3C-like proteinase (3CLpro) flanked by 2 
transmembrane domains (TM), then the RNA-dependent RNA poly- 
merase (RdRp) and eventually a zinc-cluster binding domain (Zm) 
fused to a helicase belonging to superfamily 1 (HEL1). This specific 
protein organization can be summarized as follows: Nter-TM2- 
3CLpro-TM3-RdRp-(Zm)-HEL1-Cter. 
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The Coronaviridae together with the Roniviridae and the 
Mesoniviridae families are called “large-genome nidoviruses”, 
because their genome length spans from 20kb to 32 kb. In con- 
trast, viruses of the fourth family, the Arteriviridae, belongs to the 
so-called “small-genome nidoviruses”, since their genome length 
is between 13 and 16 kb (Gorbalenya et al., 2006). 

Among Nidovirales, only some coronaviruses (CoVs) are known 
to infect humans. Human coronaviruses mostly cause mild respira- 
tory infections. However, in 2003, anew coronavirus was identified 
to be responsible of a Severe Acute Respiratory Syndrome (SARS) 
(Stadler et al., 2003). SARS-CoV eventually spread across the globe, 
causing a total of around 8400 infections and with a case/fatality 
rate (CFR) of 10%. Studies following the pandemic showed that 
SARS-CoV is a zoonotic virus jumping from bats to humans upon 
undefined critical mutational events, and possibly through palm 
civets as intermediate hosts (Li et al., 2005; Wang and Eaton, 2007). 
Recently, the direct progenitor virus of SARS-CoV from bat has been 
isolated (Ge et al., 2013). This bat SARS-like CoV uses the human 
angiotensin converting enzyme II (ACE2) receptor and live viruses 
were recovered from Vero E6 cells (Ge et al., 2013). Hence, this 
virus does not require any mutations or intermediate hosts to infect 
human cells. 

A decade after the SARS crisis, a new coronavirus was identi- 
fied in the Middle East causing a form of pneumonia associated in 
some instances to kidney failure (De Groot et al., 2013; Zaki et al., 
2012). This coronavirus is now named Middle East Respiratory Syn- 
drome Coronavirus (MERS-CoV), and the epidemic is still ongoing 
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two years after its discovery, counting at present (July 2014) 837 
cases with the CFR of approximately 30%. MERS-CoV seems to be 
also of a zoonotic origin, as antibodies against MERS-CoV have been 
detected in camels (Perera et al., 2013; Reusken et al., 2013) anda 
case of camel-to-human transmission has been recently reported 
(Azhar et al., 2014). 

These two episodes confirm that few mutations allow zoonotic 
coronaviruses to jump the species barriers and threat human health 
worldwide. To date, no specific treatment against CoVs is avail- 
able. A better understanding about how coronaviruses replicate 
in the host is therefore of critical importance to develop antiviral 
strategies aiming at controlling their spread and morbidity. 

Coronavirus RNA replication and transcription are mediated 
by nonstructural proteins (nsps) encoded by two open reading 
frames, ORFla and ORF1b (Fig. 1). These two ORFs are translated 
in a cap-dependent manner to yield two overlapping polyproteins 
referred as ppla and pp1lab. ORF1b is translated only upon a ribo- 
somal frameshift (—1) near the 3’ end of ORF1a, with ~30% efficacy 
(Snijder et al., 2003). Hence, the nsps encoded in ORF1b (nsp12 to 
nsp16) are expressed at lower levels compared to ORF1la-encoded 
products (nsp1 to nsp11). After polyprotein maturation mediated 
by viral proteases, these nsps associate to form a large replica- 
tive/transcription complex (RTC) with yet unknown cellular factors. 
The RTC not only entails the synthesis of new viral genomes from 
a full-length negative-stranded template, but also a nested set of 
so-called subgenomic (sg) MRNAs encoding structural and acces- 
sory proteins (Fig. 1) (Pasternak et al., 2006). Coronavirus genome 
encodes for a total of 16 nsps, many having predicted and/or exper- 
imentally established function, and some others with yet defined 
functions. 

Among them, the RNA-dependent RNA polymerase (RdRp) 
encoded by nsp12 plays a central role in both RNA replication and 
transcription of the viral RNA. In general, RNA virus RdRps deal with 
genomes spanning from 5 to 15kb in length. Coronavirus RdRps 
stand out as they not only replicate large RNA molecules (up to 
32 kb) but also transcribe the nested set of sg mRNAs. Coronaviruses 
also possess other activities typically found in (+) RNA viruses, such 
as helicase and capping enzymes. On the other hand, less common 
or even unique RNA-processing activities are associated with other 
nsps, such as two nuclease activities: a 3’-5’ Exonuclease (ExoN) in 
the amino-terminal domain of nsp14 (Minskaia et al., 2006), and an 
endoribonuclease (NendoU) in nsp15 (Ivanov et al., 2004a). While 
the role of the former begins to be well established in proofreading, 
the role of the latter is still unclear. 

Coronaviruses also encode for another group of unique and 
interesting proteins: nsp7 to nsp10, which are all small proteins 
with no known protein homolog. Initially, they were all described 
as RNA binding proteins (Egloff et al., 2004; Su et al., 2006; Zhai 
et al., 2005). In more recent works, three of them (nsp7, nsp8 and 
nsp10) were shown to act as activating co-factors of the polymerase 
(nsp7, nsp8), the 3’-5’ ExoN (nsp10) and the 2’O-methyltransferase 
(MTase) (nsp10) activities. 

In this review, we present the current state of knowledge on 
SARS-CoV enzymes involved in RNA synthesis, proofreading, and 


capping. 


2. Formation of the replicative/transcription complex (RTC) 


In coronaviruses, replication refers to viral genome amplifica- 
tion, while transcription refers to the synthesis of nested set of sg 
mRNAs. Both processes are mediated by the RTC, which is located in 
a membranous network, called reticulo-vesicular network, which 
arises from the rearrangements of endoplasmic reticulum (ER) 
membranes (Knoops et al., 2008). Nsp3, through its transmembrane 
domain and together with two transmembrane proteins (nsp4 and 


nsp6) contribute to the anchoring of the RTC to membranes. It may 
be formed after viral replication cycles in order to hide replica- 
tion intermediates (dsRNA) from innate immunity sensors (Knoops 
et al., 2008). 


3. The replication/transcription complex catalytic core 


As for all (+) RNA viruses, the replication catalytic core con- 
tains at least a helicase and an RNA-dependent RNA polymerase 
(RdRp). 


3.1. The NTPase/helicase nsp13 


A helicase domain of superfamily 1 helicases is harbored in 
SARS-CoV nsp13 C-terminal half (Ivanov et al., 2004b). SARS-CoV 
nsp13 is able to unwind dsRNA (and dsDNA) in a 5’-3’ direction 
and required energy obtained from the hydrolysis of any NTPs 
and dNTPs (Ivanov et al., 2004b; Lee et al., 2010; Tanner et al., 
2003). The N-terminal region of nsp13 contains a zinc-binding 
domain forms of 12 conserved cysteine and histidine residues. This 
domain is conserved in all nidoviruses and is critical for the human 
CoV-229E (HCoV-229E) helicase activity in vitro (Seybert et al., 
2005). In order to promote dsRNA unwinding, an RNA substrate 
of at least five nucleotides in length is required and an estimated 
unwinding rate of ~280 base-pairs per second has been calculated, 
which is similar to other helicases (Adedeji et al., 2012). Interest- 
ingly, nsp13-mediated helicase activity has been reported to be 
stimulated 2-fold by nsp12 RdRp through direct protein/protein 
interaction (Adedeji et al., 2012). 


3.2. Nsp12 RdRp and its cofactors nsp7 and nsp8: a processive 
polymerase complex 


Nsp12 is an RNA-dependent RNA polymerase (RdRp) containing 
all conserved motifs (named with letters from A to F) of canonical 
RdRps (Poch et al., 1989). The polymerase active site (Ser-Asp-Asp 
within motif C) is conserved in all nidoviruses. Nsp12 also carries 
the motif G (Gorbalenya et al., 2002), which is a signature sequence 
of RdRps that initiate RNA synthesis in a primer-dependent man- 
ner (Te Velthuis et al., 2010). The in vitro nsp12 polymerase activity 
appeared weak and non processive (Cheng et al., 2005; Te Velthuis 
et al., 2010), at odds with the efficient replication of the very large 
RNA genome in vivo. Protein/protein interaction maps were estab- 
lished for SARS-CoV and among others, the nsp12/nsp8 interaction 
was recovered (Imbert et al., 2008). The interaction between SARS- 
CoV nsp7 and nsp8 is also ascertained as they were co-crystallized 
and proposed to act as processivity factor for nsp12 (Zhai et al., 
2005). 

In our lab, we tested this hypothesis and detected activation 
of the primer-dependent polymerase activity of nsp12 upon addi- 
tion of nsp7 and nsp8 (Subissi et al., 2014b). This nsp7/nsp8/nsp12 
complex is also able to synthesize RNA de novo in a processive fash- 
ion at least up to 340 nucleotides. To better understand how nsp7 
and nsp8 activate and confer processivity to the RNA-synthesizing 
activity of nsp12, we used biochemical assays and a reverse genet- 
ics approach. We identified three amino acids in SARS-CoV nsp7 
(K7, H36, and N37) that confer RNA-binding properties to the poly- 
merase complex and that are also important for virus replication 
in vivo. Our study also pointed out three nsp8 residues (K58, P183, 
and R190) critical for SARS-CoV genome replication. The first one 
is involved in the interaction of the polymerase complex with RNA, 
while the two others are essential for nsp8/nsp12 interaction. 
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Fig. 1. Coronavirus genome replication and transcription. The two open reading frames (ORFs) of the genomic mRNA are translated by a cap-dependent mechanism, yielding 
two polyproteins, ppla and pp1ab, the latter requiring —1 ribosomal frameshifting (FS) near the 3’ end of ORF1la. Subsequently, the two polyproteins are cleaved by two 
or three ORF1a-encoded proteases to release a total of 16 nonstructural proteins (nsp1—nsp16). Then, they assemble with modified intracellular membranes (Knoops et al., 
2008) to form the replication/transcription complex (RTC). This huge RTC not only entails the synthesis of new genomes, but also produces a nested set of subgenomic (sg) 
mRNAs encoding structural and accessory proteins. The gray ball represents the cap structure. Nonstructural, structural and accessory genes are represented in blue, green 


and purple, respectively. 


3.3. Nsp7 and nsp8, priming and/or processivity factors for 
nsp12? 


Nsp8 of SARS-CoV, feline CoV (FCoV) and HCoV-229E were pre- 
viously shown to exhibit RNA polymerase activities (Imbert et al., 
2006; Te Velthuis et al., 2010; Xiao et al., 2012). Moreover, the nsp8 
N-terminal amino-acid composition appears to be important for 
this activity. Indeed, SARS-CoV nsp8 with an N-terminal His tag can 
synthesize only short oligonucleotides (<6 nt) (Imbert et al., 2008), 
while SARS-CoV and FCoV nsp8 without this tag are able to synthe- 
size much longer RNA (Te Velthuis et al., 2010; Xiao et al., 2012). 
Upon addition of nsp7 to these “improved” recombinant nsp8, the 
RNA polymerase activity is stimulated (Te Velthuis et al., 2010; Xiao 
et al., 2012). Notably, the processing intermediates nsp7-nsp10 of 
HCoV-229E is also able to synthesize oligoribonucleotides (Xiao 
et al., 2012). 

Since nsp12 likely synthesizes RNA in a primer-dependent man- 
ner (Gorbalenya et al., 2002), nsp8 was proposed to act as a primase 
for nsp12. Moreover, SARS-CoV nsp8 crystal structure has been 
solved in complex with nsp7, showing a hexadecameric struc- 
ture formed by eight molecules of nsp7 associated with eight nsp8 
molecules (Zhai et al., 2005). This structure is reminiscent of pro- 
cessivity factors such as the eukaryotic PCNA sliding clamp (Prelich 
et al., 1987), and is able to accommodate a dsRNA in its central 
channel (Fig. 2A) (Zhai et al., 2005). 

However, when the nsp8 primase was assayed in the context of 
the highly active polymerase complex (nsp7/nsp8/nsp12), no short 
RNAs could be detected (Subissi et al., 2014b). A hypothesis of this 


discrepancy could be that nsp8 in complex with nsp7 and nsp12 
adopted a structural conformation that prevented the synthesis 
of short RNAs. In other words, nsp7/nsp8 may synthesize primers 
before associating with nsp12, or nsp7/nsp8-mediated primer 
synthesis may trigger the formation of the nsp7/nsp8/nsp12 poly- 
merase complex. This possibility is to be examined in light of the 
tridimensional structure solved for nsp7 and nsp8 from FCoV (Xiao 
et al., 2012). In this structure, FCoV nsp7 and nsp8 assemble into a 
heterotrimer formed by one nsp8 molecule associated to two nsp7 
molecules, without any central channel (Fig. 2B). SARS-CoV and 
FCoV nsp8 monomer folds are the same, except the conformation 
of the N-terminal helical domain that differs due to rotation over 
one or perhaps two hinge regions in the long alpha-helix, which 
undergoes partial unfolding at these hinge regions (Fig. 2C) (Xiao 
et al., 2012; Zhai et al., 2005). 

Coronavirus nsp8 in association with nsp7 reveals a remarkable 
plasticity that may explain several functions triggered by the nature 
of their partnership and/or quaternary structure. 


3.4. The puzzling nsp9 RNA-binding protein 


Nsp9 is a small (13 kDa in SARS-CoV) protein unique to coro- 
navirus (Snijder et al., 2003). The two available crystal structures 
of the protein (Egloff et al., 2004; Sutton et al., 2004) reveal a 
novel fold made of an open six-stranded B-barrel and an additional 
a-helix that contains a GXXXG protein/protein interaction motif 
(Miknis et al., 2009) involved in dimer formation. Noteworthy, this 
dimerization is critical for virus replication (Miknis et al., 2009). 
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Fig. 2. Coronavirus nsp7/nsp8 structures. (A) Structure of the SARS-CoV nsp7/nsp8 hexadecamer supercomplex (from Zhai et al., 2005; PDB 2AHM). To the left of this panel, 
SARS-CoV nsp7 and the two conformations of nsp8 are colored in orange, cyan and dark blue, respectively. To the right of this panel, the surface is colored according to 
electrostatic potential (blue, positive charge; red, negative charge). (B) Structure of the FCoV nsp7/nsp8 heterotrimer (left panel): two nsp7 molecules (in green) are associated 
to one molecule of nsp8 (in pink) (from Xiao et al., 2012; PDB 3UBO). The right panel shows the surface colored according to electrostatic potential (blue, positive charge; 
red, negative charge). (C) C-terminal domain superimposition of the two SARS-CoV nsp8 forms (in cyan and dark blue) with FCoV nsp8 (in pink). The amino-terminal domain 
of nsp8 is very flexible, even in association with nsp7. Images were generated using PYMOL. 


In addition, SARS-CoV nsp9 binds ssRNA without sequence speci- 
ficity (Egloff et al., 2004), and has been shown to interact with nsp8 
(Sutton et al., 2004). The DNA replication catalytic core is formed, at 
least, by the association of a polymerase and its processivity factor, 
a helicase, and a single-strand DNA-binding protein (SSB). In the 
case of SARS-CoV, the first three actors were identified in nsp12, 
nsp8/nsp7 and nsp13, respectively. Regarding the nsp9 structural 
features, we can hypothesis that this protein may act as an SSB-like 
molecule thus protecting the CoV RNA genome from degradation 
during replication. 


4. Is the 3’-5’ exonuclease nsp14 an RNA proofreading 
enzyme? 


The study of the coronavirus RNA repair machinery began with 
the discovery, by A.E. Gorbalenya and co-workers of nuclease motifs 
in ORF1b of the coronavirus genome. The discovery was pub- 
lished as part of the full analysis of the SARS-CoV genome (Snijder 
et al., 2003). Amongst other RNA processing enzymes, the authors 
reported that nsp14 N-terminal domain is an RNA 3’-5’ exonucle- 
ase (ExON) from the DEDD superfamily (Zuo and Deutscher, 2001). 


They proposed the ExoN activity to be related to RNA proofread- 
ing, repair, and/or recombination. These hypotheses were based on 
DNA organisms, which have developed mechanisms for their large 
DNA genome biosynthesis and maintenance. 

Ubiquitously spread in large-genome viruses, genes encoding 
homologs of coronavirus nsp14 EXON are found in all nidoviruses 
(toro and roniviruses, with respectively 28 and 26kb genomes), 
and mesoniviruses (a recently discovered related viral fam- 
ily with a 20kb genome) (Zirkel et al., 2011). Small genome 
nidoviruses (arteriviruses, 13-15kb) do not possess such ExoN 
activity (reviewed in, Gorbalenya et al., 2006). Outside nidoviruses, 
nsp14 ExoN homologs are rare, and have been solely identified 
in arenaviruses so far. In this last case, the ExON activity is likely 
involved in counteracting the antiviral innate immunity (Hastie 
et al., 2011). 

The seminal discovery of the CoV 3’-5’ ExoN activity was fol- 
lowed by sequential findings confirming its validity and raised 
great interest in various fields such as evolution, drug design and 
drug resistance. The 3’-5’ ExoN enzyme activity was subsequently 
demonstrated at the enzyme level (Minskaia et al., 2006). The fol- 
lowing most compelling demonstration was achieved by Eckerle 
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Fig. 3. Structure of 5’-end RNA cap. Cap structure is formed of a N7-methylated guanosine linked to the first RNA nucleotide via a 5’-5’ triphosphate bond. A second methyl 
group is added in 2’0 position of the first nucleotide (for SARS-CoV, the first nucleotide is an adenosine). 


et al. (2007). These authors showed that mutations in the DEDD 
motif, usually leading to the inactivation of the exonuclease activ- 
ity, allowed recovering viable viruses that exhibit a 15-20-fold 
increase in replication errors. The study was conducted in MHV, 
a betacoronavirus-like SARS-CoV. Several significant complements 
and refinements of these studies appeared subsequently (Eckerle 
et al., 2010). Whereas less than one mutation per sequenced 
genome was observed for wild-type SARS-CoV (corresponding to 
an error frequency of 2 x 10->), the SARS-CoV ExoN(—) showed 
10 mutations per sequenced genome (error frequency =3 x 1074) 
(Eckerle et al., 2010). The same group recently showed that coron- 
aviruses lacking exoribonuclease activity are susceptible to lethal 
mutagenesis (Smith et al., 2013). Indeed, they demonstrated that 
5-fluorouracil (5FU) is incorporated into SARS-CoV and MHV RNA 
genomes, driving increased mutagenesis in ExoN(—) viruses (Smith 
et al., 2013). In addition, ribavirin, a guanosine analog, was also 
tested ona MHV ExoN(— ) strain exhibiting up to 300-fold increased 
in sensitivity to this compound. 

At the enzyme level, mechanistic studies showed that nsp14 
EXON activity is strongly enhanced (>35-fold) by the addition of 
nsp10 (Bouvet et al., 2012). The nsp10/nsp14 ExoN complex was 
shown to be dsRNA-specific, but also able to excise one 3’ mis- 
matched nucleotide mimicking a polymerase-mediated nucleotide 
misincorporation product (Bouvet et al., 2012). It is however 
unclear which enzyme state, nsp10/14 complex or nsp14 alone is 
involved in the aforementioned effects on RNA replication errors. 

Evidences accumulate in favor of an RNA proofreading sys- 
tem versus a post-replicative mismatch repair. The latter generally 
involves a strand recognition system (e.g., neosynthesized ssRNA 
methylation) to determine which nucleotide has to be corrected in 
a mismatch-bearing dsRNA. Then, a succession of enzymatic activ- 
ities are required to replace the mismatched nucleotide: dsRNA 
endonucleolytic cleavage, RNA re-synthesis, and RNA ligation. 
Although several of these activities are embedded in the corona- 
virus ORF1ab, both strand recognition and ligase activities remain 
to be determined. In contrast, RNA proofreading generally involves 
a minimal set of activities of the polymerase dealing with the mis- 
matched nucleotide and, a single 3’-5’ exonuclease activity vicinal 
to the active RNA polymerase. 

To formally demonstrate the occurrence of RNA proofreading 
by nsp14, several questions have to be addressed, some of which 
have already been answered. First, nsp14 has to be made available 


to nsp12, and both should exhibit activities. This has been demon- 
strated in Subissi et al. (2014b), with the association of nsp12 and 
nsp14 into an active complex. Second, the RNA mismatch clearance 
pathway should be reconstituted in vitro. That is, not only mismatch 
formation by nsp12 should be documented (nature and frequency), 
but also mismatch extension vs mismatch excision, yielding an esti- 
mate of the mismatch fixation into the RNA genome. 


5. Coronavirus cap formation pathway 


Eukaryotic messenger RNA translation can be initiated by 2 dif- 
ferent signals at the 5’-end, either by a cap structure or by a highly 
specific RNA tertiary structure named internal ribosome entry site 
(IRES). 

Nidovirus genomes possess a cap structure at their 5’-end (Lai 
and Stohlman, 1981; Van Vliet et al., 2002). The cap is com- 
posed of a methylated guanosine at position N7 and linked to 
the first transcribed nucleotide of the RNA by a 5/-5’ triphos- 
phate bridge, called “cap-0” (7“GpppN) (Fig. 3). A “cap-0” structure 
can then be converted into “cap-1” (’“Gppp,,2N) by the addition 
of a methyl group to the 2’ position of the first RNA nucleotide 
(Furuichi and Shatkin, 2000) (Fig. 3). The cap structure is then rec- 
ognized by the eukaryotic initiation factor 4E (eIF4E), which, in 
association with other elFs, recruit the small ribosomal subunit 
AOS (for review, Decroly et al., 2012). Another cap function was 
described as protecting the mRNA from cellular 5’-3’ ExoN degra- 
dation (Furuichi et al., 1977). Recently, it has been shown that the 
2'-O-methylation of the viral cap prevents viral genome recogni- 
tion by the innate immune system of the host cell (Ztist et al., 
2011). 

Generally, cap formation requires three successive steps (Fig. 4) 
(Furuichi et al., 1976): (1) an RNA 5’-triphosphatase (RTPase) 
hydrolyses the 5’ y-phosphate of the nascent mRNA; (ii) a 
guanylyltransferase (GTase) catalyses the transfer of a guanosine 
monophosphate (GMP) to the diphosphate MRNA 5’ end; (iii) an 
RNA methyltransferase (MTase) adds a methyl group at the N7 
position of the cap guanine. As previously mentioned, CoV cap is a 
“cap-1”, and thus another MTase transfers a methyl group to the 2’0 
position of the first mRNA nucleotide and this enzyme is so called 
2'-O-MTase. As already mentioned, RNA cap methylation is a very 
important process, “hiding” the viral MRNA from innate immunity 
sensors but also preventing the pyrophosphorolytic reversal of the 
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Fig. 4. SARS-CoV RNA capping pathway. SARS-CoV uses a canonical capping pathway. The 5’-end y-phosphate of the RNA is hydrolyzed by the nsp13 RTPase activity. Then, 
the still unknown RNA GTase transfers a GMP to the 5’-diphosphate RNA end, forming a 5’-5’ triphosphate bond. Next, the nsp14 N7-MTase transfers a methyl group from 
the SAM donor to the N7-position of the cap, forming the “cap-O” structure. Lastly, the nsp16/nsp10 2’O-MTase methylates the first RNA nucleotide in the 2’0 position, to 


form the “cap-1” structure. 


guanylyltransfer reaction (Furuichi et al., 1977; Shuman, 2001). 
Knocking-out cap methylation genes in SARS-CoV is lethal if not 
detrimental for virus replication (Almazan et al., 2006; Chen et al., 
2009). 

SARS-CoV replicates in the cytoplasm, without access to the 
nuclear host-cell machinery for mRNA capping. Therefore, SARS- 
CoV has two options for capping its mRNAs: either the virus can 
hijack the cellular machinery or it has to produce its own RNA- 
Capping enzymes (Mao and Shuman, 1994). So far, out of the four 
necessary activities involved in MRNA capping, three were identi- 
fied in the CoV ORF1b (Figs. 1 and 4) (Subissi et al., 2014a). Current 
knowledge regarding the 4 steps of cap formation is more fully 
described in Sections 5.1—5.4. Unless otherwise specified, all data 
were shown for the SARS-CoV. 


5.1. The RNA 5’-triphosphatase (RTPase ) 


RTPase catalyses the cleavage of the interphosphate bond 
between the B- and the y-phosphates at the 5’-end of a triphos- 
phorylated mRNA. Biochemical assays indicate that this function, 
together with a helicase activity might be played by nsp13 (as 
described in paragraph 3.1) (Ivanov et al., 2004b). The combina- 
tion of RTPase and helicase activities carried by the same protein 
has already been observed for many RNA viruses (Wengler and 
Wengler, 1993; Bisaillon et al., 1997; Vasiljeva et al., 2000; Li et al., 
2001; Benarroch et al., 2004b). Nevertheless, the exact RTPase 
mechanism as well as the high-resolution structure of nsp13 are 
still unknown. 


5.2. The guanylyltransferase (GTase) 


The guanylyltransferase (GTase) enzyme allows the transfer of 
GMP to the 5’-diphosphate mRNA end (Fig. 4) (Roth and Hurwitz, 
1984). Most GTase proteins were shown to create covalent links 
with GMP before its subsequent transfer onto the diphosphate end 
of the RNA (Ahola and Ahlquist, 1999). 

Many efforts have been made to identify the SARS-CoV GTase, 
unsuccessfully so far. No GMP covalent link was found for nsp7, 
nsp8, nsp10/nsp16, nsp12, nsp13, nsp14 or nsp15 (Jin et al., 2013). 
Yeast trans-complementation experiments, that had allowed dis- 
covery of the N7-MTase activity (see Section 5.3), did not allow the 
identification of the SARS-CoV GTase activity (Chen et al., 2009). 
This function remains elusive. 


5.3. The N7-methyltransferase (N7-MTase) 


The methylation of the mRNA at the N7 position of the cap is 
the third step of cap formation and leads to a “cap-0” (7™GpppN) 


(Fig. 3). N7-MTases are S-adenosyl-L-methionine (SAM) depend- 
ent enzymes. A methyl group is transferred from the SAM to the 
cap-RNA, producing also S-adenosyl-homocysteine (SAH) as by- 
product. 

Functional screenings in yeast have revealed that SARS-CoV 
nsp14 is able to complement the deletion of the cellular cap 
N7-MTase (Chen et al., 2009). Furthermore, this activity was 
demonstrated by biochemical analysis. Nsp14 is able to methy- 
late the N7 position in presence of SAM without any sequence 
specificity (Chen et al., 2009; Bouvet et al., 2010). Indeed, nsp14 
N7-MTase is active on different types of substrates: GTP and dGTP 
as well as cap analogs (GpppG, GpppA and ™’GpppG) (Jin et al., 
2013). By sequence alignment and comparison of conserved motifs, 
the N7-MTase activity was attributed to the C-terminal domain of 
nsp14. Biochemical analysis revealed that both nsp14 enzymatic 
activities (EXON and N7-MTase) are functionally distinct. However, 
based on deletion mutation analysis the two functional domains 
seem structurally linked (Chen et al., 2009) and a large region of 
nsp14(aa 62 to 527) is essential for both N7-MTase and ExOoN activi- 
ties. The N7-MTase activity is strongly dependent on the N-terminal 
domain. However, the N-terminal ExoN domain is not involved in 
substrate binding for the N7-MTase activity (Chen et al., 2013). 
Nsp10, which was shown to strongly enhance the nsp14 ExoN 
activity (Bouvet et al., 2012), has no effect on the nsp14 N7-MTase 
activity (Bouvet et al., 2010). 

The most conserved motif among N7-MTases is the SAM- 
binding site with a specific sequence signature “D/ExGxGxG” 
(where x refers to any amino acid) (Martin and McMillan, 2002). 
According to CoV nsp14 sequence alignments, the motif DxGx- 
PxG/A (aa 331 to 337) may represent the SAM-binding site of the 
nsp14 N7-MTase (Chen et al., 2009; Bouvet et al., 2010). Moreover, 
two more regions, the first ranging from aa 310 to 428, overlapping 
with the core domain of N7-MTase, and the second ranging from aa 
73 to 86, located at the N-terminus of the ExoN domain were found 
to be essential for nsp14 N7-MTase activity (Chen et al., 2013). 

The SAM-dependent MTase family exhibits low sequence con- 
servation but possesses a highly conserved structural fold, named 
Rossmann-like fold (Martin and McMillan, 2002; Schubert et al., 
2003). This fold is an alternation of B-stranded and a-helical 
regions, with 7 strands forming a central relatively planar B-sheet 
and a varying number of helices creating two layers, one on each 
side of the plane (Martin and McMillan, 2002). Based on the known 
structure of mRNA cap N7-MTase, a structural model of SARS-CoV 
nsp14 core N7-MTase was created (Chen et al., 2013). This model 
confirmed the implication of DxGxPxG/A motif in SAM binding. 
Nevertheless, the determination of the full-length nsp14 structure 
is required to better understand the functional mechanism of the 
N7-MTase activity. 
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5.4. The 2’-O-methyltransferase (2'-O-MTase ) 


A 2’-O-MTase transfers a methyl group from the SAM methyl 
donor to the first mRNA nucleotide. This step converts the “cap-0” 
to a “cap-1” structure (’“Gppp,,2N), this last form being found in 
all higher eukaryotes (Fig. 3). Lower eukaryotes, including yeast, 
employ usually a “cap-0” (““GpppN) structure. Many RNA viruses 
have evolved to adopt the “cap-1” structure, yielding to “cellular- 
like mRNA”. Zist et al. (2011) highlighted a biological role for 
viral MRNA 2’-O-methylation. They showed that 2’-O-methylation 
subverts the induction of type | interferon, acting as a molecular 
signature to distinguish self and non-self mRNA. 

2'-O-MTases belong to the same family as N7-MTase, with low 
sequence identity but acommon Rossmann-like structural fold (see 
section 5.3) encompassing the canonical catalytic tetrad K-D-K- 
E (Bujnicki and Rychlewski, 2001; Ferron et al., 2002; Benarroch 
et al., 2004a). The functional mechanism of the vaccinia virus (vv) 
2’-O-MTase VP39 has been extensively described (Hodel et al., 
1996; Lockless et al., 1998). The reaction mechanism seems to 
involve a bi-molecular nucleophilic substitution (Sy2 type) in 
which the nucleophilic ribose 2'-hydroxyl group of a lysine attacks 
the SAM methyl group, thus leading to the release of SAH (Li et al., 
2004). 

Bioinformatics studies on the SARS-CoV have predicted a 
SAM-dependent 2’-O-MTase harbored by nsp16 (Von Grotthuss 
et al., 2003). This activity was confirmed in vitro for FCoV nsp16 
(Decroly et al., 2008) and, with a lower activity for SARS-CoV 
nsp16. Bouvet et al. (2010) showed evidences that SARS-CoV 
nsp16 2’-O-MTase activity is stimulated by the addition of nsp10 
through direct protein/protein interaction. They found out that the 
nsp10/nsp16 complex can methylate the 2’0 position of the first 
mRNA nucleotide only if the cap is already methylated at the N7 
position. Using a mutational approach, K4g-D139-K179-E2093 residues 
were identified as the catalytic tetrad (Fig. 5). The 2’-O-MTase activ- 
ity is completely abolished by any single mutation of this catalytic 
tetrad, demonstrating that although nsp10 stimulates the nsp16 2’- 
O-MTase activity, the catalytic site itself resides in nsp16 (Bouvet 
et al., 2010). Noteworthy, a similar activation of MTase activity 
was previously reported for the vaccinia virus capping enzyme 
(Mao and Shuman, 1994). The structure of the co-factor nsp10 was 
first solved and revealed a novel fold with a pair of antiparallel N- 
terminal a-helices stacked against an irregular 8-sheet, a coil-rich 
C-terminus and two Zn fingers (Joseph et al., 2006; Su et al., 2006). 
The crystal structure of SARS-CoV nsp16 in complex with nsp10 was 
determined in 2011 by 2 groups (Fig. 5) (Chen et al., 2011; Decroly 
et al., 2011). The nsp10 overall structure in complex with nsp16 
remains quite similar to the one determined for the free nsp10. 
Nsp16 adopts a canonical SAM-dependent MTase fold with two 
binding domains, one for the methyl donor SAM and another for the 
methyl acceptor substrate (capped-RNA). Nsp16 fold is a central B- 
sheet of 7 strands surrounded by five a-helices. Compared to the 
MTase canonical fold, two a-helices are absent in the SAM binding 
region of nsp16, but the complex formed with nsp10 seems to com- 
pensate and stabilize the overall structure of nsp16. Whereas earlier 
reports claimed that Mg2* was necessary for the nsp10/nsp16 2’-O- 
MTase activity (Bouvet et al., 2010), no cation was observed in the 
active site. However, the structure shows a Mg?* located far from 
the SAM-binding site (Fig. 5). Itis thus possible that divalent cations 
play a structural role in the nsp16 structure in stabilizing the gen- 
eral fold. The organization of the active site is in agreement with 
the current 2'-O-MTase mechanism described above, transferring 
the methyl group of SAM to capped-RNA. Although no RNA-bound 
structure is available to date, a putative RNA-binding groove was 
proposed based on the highly positively charged patch found on 
nsp16 at the proximity of the catalytic site (Fig. 6) (Chen et al., 
2011; Decroly et al., 2011). 


nsp16 


Fig. 5. Crystal structure of SARS-CoV nsp10/nsp16 complex. Ribbon representation 
of nsp10 (in gray) in complex with nsp16 (in yellow) (from Decroly et al., 2011; 
PDB 2XYQ). Mg?* ion (in brown) found on nsp16 is localized at the opposite side of 
the 2’O-MTase active site which is formed by the catalytic tetrad K46, D130, K170 
and E203 (in pink). The by-product of the reaction, SAH in cyan is localized near the 
catalytic site. The two Zn2* ions (orange) on nsp10 are not involved in the interaction 
with nsp16. 


Nsp10 alone does not exhibit any enzyme activity but appears 
to be central for two distinct regulatory mechanisms (nsp14 ExoN 
and nsp16 2’-O-MTase). Our group has determined the interaction 
surface of nsp10 with nsp14 and with nsp16 (Bouvet et al., 2014; 
Lugari et al., 2010). These studies show that the same nsp10 inter- 
face is involved in complex formation with both nsp16 and nsp14. 


Sinefungin & 
or 


SAH 


Fig. 6. Sinefungin binding site and hypothetical RNA binding groove on SARS-CoV 
nsp16. Electrostatic potential surface of nsp16 presented with SAH (in cyan) and 
sinefungin (in yellow), which are localized in the same pocket. A putative binding 
groove for a capped-mRNA is indicated. 


Source: From Decroly et al. (2011) and Chen et al. (2011); PDB codes 2XYQ and 2XYV. 
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nsp10 


nsp16 


Fig. 7. Comparison of SARS-CoV nsp10 interaction surface with nsp14 and with 
nsp16. The SARS-CoV nsp10/nsp16 complex structure is presented with nsp10 
solvent-accessible surface in gray, and nsp16 in yellow ribbon. Nsp10 residues 
involved in the interaction with nsp14 are depicted in green. The interaction surface 
of nsp10 with nsp16 and nsp14 overlaps, the latter being wider (Bouvet et al., 2014). 


However, in the case of nsp10/nsp14, the interaction interface is 
substantially wider (Fig. 7). As a consequence, nsp10 cannot inter- 
act simultaneously with both nsp14 and nsp16. Nevertheless, the 
two regulatory functions of nsp10 may still act simultaneously, 
since nsp10 is translated in stoichiometric excess compared to 
ORF1b-encoded nsp14 and nsp16 (Fig. 1) (Snijder et al., 2003). 

Eukaryotic and viral mRNA capping systems show many dif- 
ferences (Furuichi and Shatkin, 2000), and given the essential role 
fulfilled by these systems, each protein involved in this process 
should be considered as a potential antiviral target. In particu- 
lar, viral MTases seem to be good candidates (Dong et al., 2008; 
Bollati et al., 2010; Dong et al., 2010). Indeed, mutations abolish- 
ing MTase activities have a detrimental effect on viral replication 
(Almazan et al., 2006; Ray et al., 2006; Chen et al., 2009). In vitro, 
sinefungin and SAH (the by-product of the methyl transfer reac- 
tion) were found to inhibit both SARS-CoV MTase activities (Bouvet 
et al., 2010). Furthermore, a crystal structure of the SARS-CoV 
nsp10/nsp16 complex with sinefungin was solved and revealed 
that the molecule is bound in the SAM-binding pocket and hence 
should inhibit nsp16 2’-O-MTase activity (Fig. 6). In addition, aurin- 
tricarboxylic acid (ATA), expected to bind MTase active site, inhibits 
SARS-CoV replication (He et al., 2004; Milani et al., 2009). Biochem- 
ical assays demonstrated that ATA inhibits both nsp14 N7-MTase 
and nsp16 2’-O-MTase activities (Bouvet et al., 2010). Recently, Sun 
et al. (2014) developed a yeast-based high-throughput antiviral 
Screening system to find novel anti-coronavirus drugs. This study 
confirmed that sinefungin significantly suppresses the growth of 
yeast cells complemented with nsp14 N7-MTase gene. However, 
sinefungin is not a specific inhibitor as it also inhibits yeast and 
human MTases (Sun et al., 2014). 


6. Conclusion 


Coronaviruses have only recently attracted a wider audience 
due to public health issues generated by SARS- and MERS-CoV 
emergences. However, remarkable achievements have been made 
regarding the elucidation of structures, functions, and mechanisms 
of what appears to be the most sophisticated and regulated (+) RNA 
genome replication/transcription machinery so far. Although other 
viruses, such as Hepatitis C and Influenza viruses, might be bet- 
ter and sooner controlled through antiviral therapies, the study of 
coronaviruses may well accelerate our current understanding of 
RNA synthesis, evolution, capping, repair, as well as all novel impli- 
cations arising from the combination and coordination of these RNA 
processing activities. Further experiments should open an entirely 
new field of research, since for viral RNA synthesis, very few data 
are available on these processes. For example, it is possible to quan- 
titate mismatch formation for some RNA viruses (Hepatitis C Virus 
(Powdrill et al., 2011), Poliovirus (Korneeva and Cameron, 2007), 
Dengue virus (Jin et al., 2011)) and sometimes only for a restricted 
set of nucleotides (Dengue virus (Jin et al., 2011)). 

The progress made on SARS-CoV RNA synthesis machinery 
comprehension will soon allow to measure, through genome 
sequencing, the RNA replication fidelity. This knowledge will likely 
open a Pandora box containing the coronavirus “playbook” expos- 
ing how they maintain the integrity of their large genomes, while 
generating the population diversity required for emergence and 
adaptation. Knowing in detail these mechanisms will provide new 
antiviral strategies. 
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